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Summary 



The aim of the work described in this thesis was to study the structure-function 
properties of the human cytokine, granulocyte-macrophage colony-stimulating factor 
(GM-CSF) in order to generate molecules with novel biological properties. The approach 
used was to generate mutated forms of GM-CSF cDNA, express mutant proteins and 
examine their biological activity and receptor binding properties. 

At the time the work for this thesis commenced the structure of GM-CSF had not 
yet been determined. However, the predicted structure of GM-CSF was a bundle of four 
alpha helices containing hydrophilic regions. Initial studies using COS and CHO cell 
expression systems focussed on residues within the first predicted helix of GM-CSF that 
contribute to a prominent hydrophilic peak. Deletion analysis and substitution 
mutagenesis of residues 20 and 21 indicated that Glu 21 was a functionally significant 
residue as charge reversal mutations at this position reduced GM-CSF activity 300-fold. 
A variety of amino acid substitutions at position 21 indicated a hierarchy of tolerance 
with Glu,Asp > Asn > Ala > Ser > Gin > Lys,Arg. The results indicated that Glu 21 was 
essential for multiple GM-CSF activities including cell proliferation and mature cell 
activation. 

The GM-CSF receptor comprises a GM-CSF specific, low affinity receptor a 
chain (GMRa) and a P chain (P c ) that does not by itself detectably bind GM-CSF but 
confers high affinity binding when co-expressed with the a chain and is required for 
signal transduction. The p c chain is shared with the IL-3 and IL-5 receptors. Receptor 
binding studies with residue 21 analogues indicated that Glu 21 is essential for binding to 
the high affinity (GMRap c ) but not the low affinity (GMRa) receptor. The results 
identified the presence of two functional domains of GM-CSF required for either GMRa 



or GMRocPc interaction and demonstrated that GM-CSF stimulation of both proliferation 
and mature cell activation are mediated through high affinity receptors. 

The functional role of other residues in the first alpha helix was examined in light 
of the critical role determined for Glu 21 . Oligonucleotide cassette mutagenesis (OCM) 
was developed to generate large numbers of mutants for expression in COS cells. 
Residues 14 and 17 to 28 of the first alpha helix of human GM-CSF were subjected to 
extensive substitution mutagenesis. Mutation of most amino acids buried in the 
hydrophobic core did not significanUy impair biological activity or receptor binding apart 
from a modest decrease in biological activity observed with mutation of residues Ala 22 
and Leu 26 . Mutation of He 19 produced a marked decrease in biological activity and 
receptor binding, probably as a result of structural perturbations. Mutation of amino 
acids located on the surface of the first helix did not significantly impair biological 
activity or receptor binding, with the notable exception of Glu 21 (discussed above). The 
conclusion was that residue 21 is the only significant |3 C chain contact on the surface of 
the first helix. 

The carboxy terminus of GM-CSF contains a prominent hydrophilic peak centred 
over the fourth alpha helix with a number of charged amino acids conserved across 
several species. The importance of these residues was examined by charge reversal 
mutagenesis and indicated a role in GM-CSF biological activity for Asp 112 . Binding 
studies showed a reduction in high- and low-affinity binding for the Dl 12R analogue. In 
this respect residue 112 appears to be functionally distinct from residue 21 and is 
apparendy involved in binding to the GMRa chain. 

An E. coli expression system was used to produce larger quantities of GM-CSF 
residue 21 analogues for detailed receptor binding studies and to enable structural 
characterisation. The analogues which included hydrophobic (Ala, Phe), hydrophilic 
(Gin) and basic (His, Lys, Arg) substitutions at position 21, proved refractive to 
expression but techniques were devised that enabled their expression and purification. 



The E. co/j'-derived GM-CSF residue 21 analogues displayed a range of biological 
activities, yet they all exhibited low affinity binding characteristics on both GMRa and 
GMRap c , similar to CHO cell-derived analogues. The E21R and E21K charge reversal 
analogues, were surprisingly devoid of activity. The lack of activity from the E. coli- 
derived E21R analogue was in marked contrast to the activity observed for the CHO cell- 
derived E21R analogue. These results demonstrated that although in some cases low 
affinity binding can lead to biological signalling, in other cases such as with the E21R 
and E21K analogues, low affinity binding can be dissociated from receptor activation 
and biological signalling. 

The E. co/i-derived E21R and E21K analogues, which bind GMRcc chain without 
eliciting a functional response, were tested for antagonistic activity. Both analogues were 
effective antagonists of the GM-CSF-mediated proliferation of leukaemic cells and the 
GM-CSF-mediated release of superoxide anions from neutrophils. The antagonism was 
effective against glycosylated and non-glycosylated forms of GM-CSF and was specific 
for GM-CSF in that no antagonism of IL-3-mediated leukaemic cell proliferation or 
TNF-a-mediated neutrophil superoxide production was observed. 

On the basis of the observations presented in this thesis, a model for the interaction 
of GM-CSF with the receptor complex is proposed that should also be applicable to 
related cytokines, IL-3 and IL-5 in particular. The critical feature of this model is that 
GM-CSF contains two functionally distinct receptor binding sites, one for the GMRa 
chain and another for the pc chain. The GMRa chain binding site includes residues from 
the fourth helix and in particular the Asp 112 residue. The p c chain binding site includes 
only Glu 21 of the first helix. A model to account for the antagonism of GM-CSF activity 
is presented which proposes that the antagonist sequesters the available GMRa chain 
into complexes unable to associate with the p c chain and therefore prevents wild type 
GM-CSF interaction with the GM-CSF receptor complex. Possible clinical implications 
for GM-CSF antagonists are discussed. 



iii 



EXHIBIT B 



Proc. Natl. Acad. Sci. USA 
Vol. 86, pp. 4872-4876, July 1989 



Identification of critical regions in mouse granulocyte-macrophage 
colony-stimulating factor by scanning-deletion analysis 



m/stnicture-ftiBctioo/mntagenesis) 



Armen B. Shanafelt and Robert A. KASTELEiNt 

Department of Molecular Biology, DNAX Research Institute of Molecular and Cellular Biology, 901 
Communicated by Charles Yanofsky, March 29, 1989 (received for review February 28, 1989) 



California Avenue, Palo Alto, CA 94304-1104 



ABSTRACT 



amined by generating a series of si 
entire length of the molecule. Deletions of three amino acids 



expressed in Escherichia coti and assayed for biological activity. 
This procedure identified four regions critical to activity. These 
critical regions were further delineated by additional three- 
amino add deletion mutants. Larger deletions at each terminus 
were also made, as well as changes of specific a 
residues. The four critical regions span a 
18-22, 34-41, 52-61, and 94-115. The d 
tween Cys-51 and Cys-93 was also shown to be e 
activity, whereas that between Cys-85 and Cys-118 could be 
removed without loss of activity. The possible structural 
and/or functional roles of the critical regions are discussed. 

The proliferation and differentiation of granulocytes and 
macrophages from progenitor bone marrow cells is mediated 
by a group of protein hormones known as the colony- 
stimulating factors (1). Four distinct colony-stimulating fac- 
tors have been identified in both mouse and human; one of 
these is granulocyte-macrophage colony-stimulating factor 
(GM-CSF). It is produced by various cell types, such as T 
lymphocytes and monocytes (2), and has multiple activities in 
vitro. It not only stimulates the formation of granulocytes and 
macrophages from bone marrow progenitor cells (1) but also 
can activate mature eosinophils, neutrophils, and macro- 
phages (3). 

Both mouse (m) and human (h) GM-CSF have been char- 
acterized by cloning, sequencing, and expression (4, 5). The 
mGM-CSF polypeptide (124 residues; M r , 14,346) is 54% 
identical to the hGM-CSF polypeptide (127 residues; Af r , 
14,650). Despite the high degree of homology, the two 
polypeptides are species specific (5). 

Limited information is available on structure-function 
relationships for GM-CSF. Proteolytic digestion of both 
proteins identified two disulfide bonds, linking the first and 
third, and second and fourth cysteine residues (6). Using 
chemically synthesized hGM-CSF protein fragments and 
analogs, Clark-Lewis et al. (7) identified residues 1-13 and 
residues 122-127 as being not critical and residues 14-25 as 
being critical for biological activity. Mutagenesis of mGM- 
CSF by Gough et al. (8) revealed that residues 11-15, 24-37, 
47-49, and 81-89 are obligatory for function. Physical and 
biochemical studies indicate that both the murine and human 
proteins have similar secondary structures containing a- 
helical and /3-sheet segments (9). 
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In this report we describe an analysis of a set of deletion 
mutants that together span the entire length of mGM-CSF. 
Characterization of 42 deletion proteins, expressed in Esch- 
erichia coli, has allowed us to identify and delineate four 
regions within mGM-CSF that are critical to the activity of 
the protein. Additional deletion and substitution mutants 
further define the importance of these and other regions of the 
molecule. 

MATERIALS AND METHODS 

Bacterial Host Strains and Vectors. The E. coli K12 strain 
JM101 (10) was used as host for the propagation and main- 
tenance of M13 DNA. CJ236 (11) was used to prepare 
uracil-DNAfor use in site-directed mutagenesis. AB1899 (12) 
was used as the host for expression of wild-type and mutant 
mGM-CSF. prNIIIompA2 (13) was used as the expression 
vector for all mGM-CSF genes. Elsewhere, we have de- 
scribed the expression of biologically active, mature GM- 
CSF with this E. coli secretory expression system (14). The 
mGM-CSF coding region was synthetically reconstructed 
using a series of oligonucleotides. Unique restriction sites 
were introduced where possible without altering the protein 
sequence and high-bias E. coli codons (15) were preferen- 
tially used. From this construct, an Xba 1-Bamffl fragment 
containing the ompA leader sequence and the entire mGM- 
CSF gene was cloned in M13mpl9 (replicative form) and used 
as the template for site-directed mutagenesis. 

Mutagenesis, Recombinant DNA, and Sequencing Protocols. 
Site-directed mutagenesis followed the protocol described by 
Kunkel et al. (11). Oligonucleotides, 20-28 nucleotides long, 
corresponding to mGM-CSF sequences incorporating the 
desired deletion or mutation were made complementary to 
the template DNA and used as primers in the mutagenesis 
reactions. Three (to five) plaques from each reaction were 
sequenced using the dideoxynucleotide method (16) with 
modifications described in the Sequenase (United States 
Biochemical) protocol. Where possible, the entire GM-CSF 
coding sequence was examined. 

M13 (replicative form) DNA containing correct mutations 
was cleaved with Xba I and BamHl (New England Biolabs). 
The digested M13 (replicative form) was combined in a 40-/xl 
ligation mixture containing 0.2 /*g of pINIIIompA2 (cleaved 
with Xba I and BaniHT) and incubated at 12°C for 4 hr. The 
ligation mixture was used to transform JM101 cells, and 
selection was made on Luria broth (LB) plates containing 
ampicillin at 50 pg/ml. Correct clones were transformed into 
AB1899 for protein expression. 

Gel Electrophoresis and Immunoblotting. SDS/polyacryl- 
amide gel electrophoresis (SDS/PAGE) was performed as 
described by Laemmli (17). Immunoblotting was performed 
essentially as described by Towbin et al. (18), except that 

Abbreviations: GM-CSF, granulocyte-macrophage colony- 
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transfer occurred at 4°C for 16 hr at 30 V. The primary 
antibody was anti-mGM-CSF peptide hybridoma mgl.8.2 
used at a 1:100 dilution of culture supernatant (19). Second- 
ary antibody was ^I-labeled mouse anti-rat IgG (Amersham) 
used at a 1 : 1000 dilution. Dried blots were exposed to Fuji RX 
film with an intensifying screen for 12-16 hr at -80°C. 

Preparation and Quantitation of Protein Extracts. Five 
milliliters of LB containing ampicillin at 50 /tg/ml were 
inoculated with 100 ftl of medium from individual overnight 
cultures of AB1899 harboring the wild-type or a mutant 
expression plasmid. The cultures were grown to mid- 
logarithmic phase at 37°C, and protein expression was in- 
duced for 4 hr with 2 mM isopropyl /J-D-thiogalactopyran- 
oside, final concentration. Three milliliters of culture were 
centrifuged in a microcentrifuge (American Scientific Prod- 
ucts, Stone Mountain, GA) for 5 min, and the pellet was 
resuspended in 500 /d of 10 mM sodium phosphate/150 mM 
NaCl, pH 7.0. The cell suspension was then lysed by soni- 
cation in a Branson Cell Disrupter 200 sonifier with 30 0.5-sec 
pulses at 40 W. SDS (0.1%, final concentration) was added to 
the lysates, and the mixtures were incubated for 5 min at 
room temperature and then centrifuged for 5 min in the 
microcentrifuge. The supernatant was recovered, frozen, 
then thawed, and centrifuged for 5 min in the microcentri- 
fuge. Also, osmotic shock fractions of some of the mutants 
were prepared as described by Koshland and Botstein (20). 
This procedure does not require SDS. No difference in 
relative activity could be detected for mutant proteins pre- 
pared according to either procedure. 

Aliquots of samples were analyzed by SDS/PAGE and 
Western immunoblotting. Autoradiograms were scanned by 
an LKB Ultroscan XL laser densitometer, and the concen- 
tration of wild-type and each mutant mGM-CSF protein was 
determined by comparing the area corresponding to the 
processed form of each protein with a standard curve gen- 
erated with purified E. co/i'-derived recombinant mGM-CSF. 
A lane containing 30 ng of purified mGM-CSF was included 
on each gel for use as an internal calibration standard. The 
variation in protein concentration was found to be 3- to 5-fold 
based on analysis of extracts from four independent clones of 
the synthetic mGM-CSF construct expressed in AB1899 
cells. 

NFS-60 Proliferation Assay for mGM-CSF Activity. Ex- 
tracts were assayed using the myeloid cell line NFS-60. 
Sample concentrations of mGM-CSF protein were adjusted 
to 36 ng/ml and titrated in quadruplicate to 0.2 pg/ml. The 
3-[4,5-dimethylthiazol-2-yl]-2,5-diphenyltetrazolium bro- 
mide (MTT) colorimetric assay described by Mosmann (21) 
was used to measure the extent of proliferation, and absor- 
bance values were read with a kinetic microplate reader 
(Molecular Devices, Palo Alto, CA). The concentration of 
each mutant and wild-type mGM-CSF that gave 50% maxi- 
mum response was determined, and relative activity was 
calculated using the relationship: % activity = ([wild- 
type]i/ 2 /[mutanf]i/2) x 100%, where [wild-type] i/ 2 and 
[mutant] 1/2 are the concentrations of wild-type and mutant 
mGM-CSF proteins, respectively, that gave 50% maximum 
response in the NFS-60 assay. 

RESULTS 

Systematic Introduction of Deletions Throughout mGM- 
CSF. The structural and functional importance of specific 
regions within the mGM-CSF polypeptide were determined 
by the systematic introduction of deletions along the entire 
length of the molecule (Fig. 1). Deletions of three amino acids 
were introduced every five amino acids on a synthetically 
generated mGM-CSF DNA sequence by site-directed muta- 
genesis. The positions of certain deletions (V10 and V17) were 
chosen to preserve cysteine residues. In all cases the altered 




38 21 39 22 40 23 38 24 

Fig. 1. Location of deletions. The primary amino acid sequence 
of mGM-CSF is shown with every 10th amino acid in boldface type. 
Each shaded box contains the amino acids deleted for a given mutant; 
the numerals below each box identify the respective mutant. 

mGM-CSF coding regions were expressed using the plasmid 
vector pINIIIompA2 (13). The leader sequence of the ompA 
gene was fused to the mGM-CSF mature sequence, directing 
the mGM-CSF protein to the E. coli periplasm. Here the 
leader sequence is removed, generating mature GM-CSF 
(14). The E. coli lotC host AB1899 was used to minimize 
proteolysis in the hope that similar expression levels would 
be obtained for each mutant protein (12). The amount of each 
mutant polypeptide produced was determined by Western 
immunoblotting. Levels of altered GM-CSF in E. coli ex- 
tracts were found to vary over a 10-fold range. We do not 
know whether this variation reflects differences in expression 
or stability of individual polypeptides in this E. coli host. 
Quantitative immunoblotting allowed us to correct for dif- 
ferences in the amounts of protein found in the extracts. 
Samples containing each mutant protein at 36 ng/ml were 
assayed for their ability to stimulate the proliferation of the 
mouse myeloid cell line NFS-60. Results were expressed as 
specific activity relative to that of native mGM-CSF pro- 
duced and assayed under the same conditions (Table 1). The 
sensitivity of the assay was estimated to be 0.01% of wild- 
type activity. Any mutant protein exhibiting less than this 
activity was considered inactive, and the amino acid residues 
in a corresponding deletion mutant were referred to as critical 
to the activity of mGM-CSF. Mutants of >0.01% wild-type 
activity were considered active, and the deleted residues of 
such mutants were defined as noncritical. The term critical 
does not distinguish between those residues that are critical 
for the structural integrity of the polypeptide or those resi- 
dues that are involved in a direct functional role — e.g., 
receptor binding, activation, or both. 

Definition of Four Regions Critical to the Activity of mGM- 
CSF. Deletions in an initial set of 24 unique polypeptides 
defined four regions that were most critical to the activity of 
mGM-CSF (Table 1; Vl-24). Additional deletion mutants 
(Table 1; V25-42) were generated and tested for activity to 
define more precisely the bounderies of the critical regions 
identified in the first set of mutants. 

Critical region I. This region is defined by the mutants V4, 
-5, -25, -26, -27, -28, -41 and -42 and corresponds to amino 
acids Ala- 18 to Ala-22. Mutants V5 and -28 cover amino acids 
Leu-23 to Leu-26. Although these deletions result in a 
100-fold decrease of activity, Leu-23 to Leu-26 are consid- 
ered expendable according to our definition of a critical 
residue. Such regions are undoubtedly important for a fully 
functional GM-CSF polypeptide but do not constitute critical 
residues. 
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Table 1. Activity and alterations of mutant mGM-CSF proteins 



Mutant protein t 


Alteration* 


% activity^ 


VI 


4-6 


343 


V2 


9-11 


789 


V3 


14-16 


70.3 


V4 


19-21 


<0.01 


V5 


24-26 


0.8 


V6 


29-31 


4.6 


V7 


34-36 


<0.01 


V8 


39-41 


<0.01 


V9 


44-46 


1.9 


vio 


48-50 


0.3 


Vll 


54-56 


<0.01 


V12 


59-61 


<0.01 


V13 


64-66 


572 


V14 


69-71 


5.7 


V15 


74-76 


. 1-1 


V16 


79-81 


3.0 


V17 


83-84:86-87 


9.9 


V18 


89-91 


0.07 


V19 


94-96 


<0.01 


V20 


99-101 


<0.01 


V21 


104-106 


<0.01 


V22 


109-111 


<0.01 


V23 


114-116 


<0.01 


V24 


119-121 


167 


V25 


15-17 


5.6 


V26 


16-18 


<0.01 


V27 


17-19 


<0.01 


V28 


23-25 


0.7 


V29 


31-33 


3.7 


V30 


42-44 


0.1 


V31 


52-54 


<0.01 


V32 


61-63 


<0.01 


V33 


62-64 


0.03 


V34 


63-65 


0.03 


V35 


90-92 


0.02 


V36 


117-119 


1.5 


V37 


96-98 


<0.01 


V38 


102-104 


<0.01 


V39 


107-109 


<0.01 


V40 


112-114 


<0.01 


V41 


21-23 


<0.01 


V42 


22-24 


<0.01 


VN19 


2-19 


<0.01 


VC94 


94-124, 






Cys-85 -» Ser 


<0.01 


VC116 


116-124 


94.1 


C118* 


Cys-118 -» Stop 


158 


K119* 


Lys-119 -» Stop 


968 


CS1S 


Cys-51 -» Ser 


<0.01 








C93S 


Cys-93 -* Ser 


<0.01 


C118S 


Cys-118 -> Ser 


6.1 


C51:93S 


Cys-51 -» Ser 






Cys-93 -» Ser 


<0.01 


C85:118S 


Cys-85 Ser, 






Cys-118^ Ser 


129 


mGM-CSF 


WUd-type 


100 



*V1 to V24 were the first series of deletion mutants tested. 
^Numbers indicate residues deleted in deletion mutants or residue 

'Activity is relative to that of wild-type mGM-CSF. 



Critical region II. This region is defined by the mutants V7, 
-8, -29, and -30 and corresponds to amino acids Asn-34 to 
Ser-41. V7 and -8 are deletions interior to the critical region; 
V29 and -30 define the maximum N- and C-terminal borders, 
respectively. V29 has appreciable activity, ~3.7%, and thus 
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it is possible that this critical region does not begin until 
Glu-35 or Glu-36; however, it must contain Glu-36 as this is 
the most 3' residue contained in V7. Conversely, V30 has very 
low activity, <0.1%, and thus the C-terminal border is most 
likely defined by Ser-41. 

Critical Region III. This region is defined by deletion mutants 
Vn, -12, -31,-32, -33, and -34 and comprises ainino acids Val-52 
to Gln-61. Both Vll and -31 are inactive and contain Thr-54, and 
thus this residue defines the minimum N-terminal boundary of 
this critical region. The use of additional three-amino acid 
deletion mutants at this junction is problematical, as they 
necessarily would include Cys-51 that was shown to be essential 
for function (see below). The mutants V32, -33, and -34 precisely 
define the C-terminal border of this critical region as Gln-61. In 
this overlapping series, V32 is inactive, but V33 and -34 have low 
activity (~0.03%). 

Critical Region IV. This region is defined by the mutants 
V19, -20, -21, -22, -23, -36, -37, -38, -39, -40, and -CU6 and 
comprises amino acids Glu-94 to Pro-115. It is the largest 
critical region in the molecule, spanning 22 amino acids or 
-18% of the mature mGM-CSF sequence. The N-terminal 
border is defined by V19 and, as in critical region III, is 
flanked by a critical cysteine residue, Cys-93 (see below). In 
this case, the amino acids N-terminal to Cys-93 appear to be 
fairly important for activity, as both V18 and -35 have very 
low activity (0.07 and 0.02%, respectively). The C-terminal 
end is defined by either Ile-114 or Pro-115, as both of these 
residues are contained in V23, and Pro-115 is the final residue 
in the active VC116 protein product. 

Fig. 2 is an illustration of the relative biological activity of 
all 42 deletion proteins vs. the location of their deletions. This 
representation reflects the relative contribution of each seg- 
ment of the molecule to the activity of intact GM-CSF, 
revealing both critical and noncritical regions. The four 
critical regions comprise 35% of the residues. The other 65% 
of the molecule can tolerate small deletions without complete 
loss of activity. It is important to note that, although 65% of 
the residues in GM-CSF seem noncritical, this does not imply 
that these regions can be deleted altogether; as is evident 
from Fig. 2, even small deletions, such as those generated 
here, often result in a substantial reduction in activity. Some 
mutant proteins have an activity that is considerably higher 
than that of native mGM-CSF. Deletions in these hyperactive 




.001 I 

0 25 50 75 100 125 

Amino Acid 

Fig. 2. Relative activity vs. location of deletion. Each mutant 
protein was assayed by its ability to drive NFS-60 proliferation. The 
percent of its half-maximal response relative to wild-type mGM-CSF 
is plotted against the location of the central ainino acid of the 
deletion. The sensitivity of the assay was estimated to be 0.01% of 
wild-type activity; any mutant protein containing less than this 
activity was scored as inactive. 
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proteins seem to cluster at either terminus. At present we 
have no explanation for this phenomenon. 

The N and C Termini Are Not Important for Activity. N 
terminus. Four N-terminal deletion mutants, VI, -2, -3, and 
-25 , retain biological activity. All except V25 , which lies at the 
border of critical region I, have close to, or greater than, full 
activity. V25 has appreciable activity (5.6%). These data 
suggest that the N-terminal 16 amino acids are expendable, 
and that Glu-17 is noncritical. The mutant VN19, which 
deletes Pro-2 to De-19, was generated to mimic active mu- 
tants of human GM-CSF (7). However, we found this mutant 
protein to be inactive, consistent with our observation that it 
contained the two N-terminal residues of critical region I. 

C terminus. The mutant VC116, which deletes the C- 
terminal 9 amino acids, is fully active. As the number of 
deleted amino acid residues decreases, the activity of each 
successive molecule increases: C118* increases to 158%; 
K119*, encoding Ala-1 to Cys-118, increases to 968%. V24 
and -36 have deletions in the expendable C-terminal domain 
and both yield active molecules, but they have significantly 
less activity than the corresponding C-terminal "complete" 
deletion mutants (1.5% for V36 vs. 94% for VC116; 167% for 
V24 vs. 968% for K119*). 

Sequences Between Critical Regions Are Not Essential for 
Activity. Segments of GM-CSF that could tolerate small 
deletions without a total loss of activity were defined as 
noncritical. Three groups of active mutants can be identified: 
(0 those of greater or equal activity as wild-type; («*/) those 
having appreciable, but reduced activity (1-10%); and (iii) 
those of greatly diminished activity (<1%) (see Fig. 2). 

The first group is comprised of almost all N- and C-terminal 
mutants: VI, -2, and -3 at the N terminus, and V24, K119*. 
C118*, and VC116 at the C terminus. Only mutant V13 at amino 
acids 64-66, which falls immediately after critical region m, 
results in a fully active molecule with an internal deletion. The 
intermediate class includes V5, -6, -9, -14, -15, -16, -17, -25, -28, 
-29, and -36 (V5 and -28 have close to 1% activity). Although 
clearly important to the activity of the molecule, the amino acids 
defined by these deletions are not critical. The last group 
includes V10, -18, -30, -33, -34, and -35. All fall on the borders 
of critical regions and have greatly reduced activities. Such 
mutant proteins are probably of low activity because they may 
perturb the integrity of the critical regions (see Discussion). 

The Disulfide Bridge from Cys-51 to Cys-93 Is Critical for 
Activity. Protease digestion analysis of h- and mGM-CSF (6) 
suggests that the four cysteines in GM-CSF pair 1-3 and 2-4. 
We generated mutant proteins that changed the four individ- 
ual cysteines to chemically similar serine residues, as well as 
double mutants that changed both cysteines of the proposed 
pairs to serines. The activities of each of these mutant 
proteins are summarized in Table 1. 

Changes of either or both Cys-51 and Cys-93 eliminated 
biological activity. These results show that the formation of 
this disulfide bond is absolutely required for an active mol- 
ecule. In contrast, changes of Cys-85 or Cys-118 resulted in 
active molecules, albeit with lower than wild-type activity. 
The double mutant C85:118S, in which both Cys-85 and 
Cys-118 were replaced by Ser, retained full activity. As one 
or both of these cysteines could be removed without elim- 
inating activity, the disulfide bridge from Cys-85 to Cys-118 
is evidently nonessential. The reduction of activity by an 
unpaired cysteine, as observed for mutant proteins C85S and 
C118S, in which Cys-85 was replaced by Ser and Cys-118 was 
replaced by Ser, respectively, may result from formation of 
inappropriate disulfide linkages. 

DISCUSSION 

The aim of our work was to delineate regions critical to the 
activity of mGM-CSF. Four regions of this nature, compris- 



ing 35% of the mature protein, were identified using system- 
atic incorporation of deletions into the wild-type sequence. 
The decision to set the size of the deletions at three amino 
acids was based on the fine structural deletion map of the 
lymphokine mouse interleukin 2 (22). In this study, three 
regions critical to activity were identified by a variety of 
techniques. A hypothetical deletion analysis of mouse inter- 
leukin 2 with various deletion sizes revealed that all three 
critical regions could be identified using deletions of three 
amino acids incorporated as infrequently as every 10 amino 
acids. Larger deletions would require fewer mutants but 
present the risk of introducing gross structural changes 
causing inactivation independent of removing functionally 
important regions. Smaller deletions, although increasing the 
precision in delineating boundaries, begin to require large 
numbers of mutants to effectively locate all important re- 
gions. The series of deletions used here left not more than two 
adjacent amino acids unperturbed, making it unlikely that any 
critical regions was overlooked. 

Identification of critical regions exposed by the deletion 
approach used here requires careful consideration. By de- 
leting any three-residue region, neighboring residues may be 
forced to occupy the space vacated by the missing residues. 
This transposition of residues may sometimes result in an 
inactive mutant as important residues are dislocated from 
their natural position. This type of secondary effect would 
result in the identification of these residues as being critical, 
even though they are not directly responsible for biological 
function. We cannot rule out this possibility. However, the 
removed residues are justifiably defined as critical since they 
are required for an active molecule. With respect to this 
issue, it is noteworthy that all highly defective mutant pro- 
teins (<1% activity) fall on the borders of critical regions. 
Presumably these proteins have severely reduced activities 
because they perturb the neighboring critical regions. 

To identify amino acid residues that are most important for 
activity, only those residues deleted in mutant proteins with 
undetectable activity were scored as critical. A consequence 
of this arbitrary definition was that deleted residues of some 
highly defective mutant proteins were considered noncritical. 
This does not imply that these residues are unimportant: it 
simply means that these residues are of lesser importance 
than critical residues. 

A synthetic mGM-CSF gene was used as the base template 
for generating all the mutants used in this study. Although it 
is not necessary for the methodology employed, we chose to 
make a synthetic gene because it facilitates examination 
through other mutational techniques. By identifying a limited 
number of regions important to an active molecule, it will be 
possible to focus on these areas using more intensive meth- 
ods such as saturation mutagenesis and the random replace- 
ment of specific amino acids. The presence of unique restric- 
tion sites available in a synthetic gene permits convenient 
access to these regions of interest. 

The nature of a critical region, whether it is functionally or 
structurally important, cannot be distinguished by the dele- 
tion method employed here. Extensive analysis of each 
critical segment will be required to determine their specific 
roles. However, some information can be obtained by a 
parallel analysis of the critical regions of m- and hGM-CSF. 
These proteins are similar molecules, both in primary se- 
quence («54% identity) and conformation (9). Yet the two 
polypeptides are species specific. Comparison of the work 
described here and that of Clark-Lewis et al. (7) on synthetic 
peptide analogs of hGM-CSF shows that the two species 
share similar domains. The size and relative location of our 
critical region I coincide with that found in hGM-CSF. 
Similarities in amino acid composition around this region also 
imply that these respective critical regions are acting in a like 
manner. The strongly hydrophobic nature of this portion of 
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the molecule as well as the observation that Trp-13 (of mouse) 
is solvent-inaccessible (9) indicate the region is internal. This 
suggests that critical region I is structurally important. In 
apparent contradiction with our results, Clark-Lewis et al. (7) 
find that a hGM-CSF molecule extending to residue 96 (the third 
cysteine) retains residual activity; the corresponding mGM- 
CSF deletion mutant VC94 has no activity. This discrepancy 
could reflect a difference in the sensitivity of the assay. 

Gough et al. (8) have identified residues 11-15, 24-37, 
47-49, and 81-89 as required for generating a functional 
mGM-CSF. The present work and that of Zurawski and 
Zurawski (22) demonstrate the importance of correlating 
expression levels of mutant proteins with their corresponding 
activities. The inability of Gough et al. (8) to determine the 
mutant protein concentrations may explain why none of the 
critical regions identified in the present work coincide with 
their results; it is possible that insufficient expression of 
mutant proteins accounts for their negative assay results. 
Additionally, their use of large deletions to identify critical 
regions introduces the potential of gross structural changes 
that may have resulted in nonspecific inactivation of the 
mGM-CSF protein. 

Circular dichroism measurements (9) and structure predic- 
tions using the algorithm of Chou and Fasman (23) suggest 
that m- and hGM-CSF are composed of both a-helical and 
/3-sheet segments. We have compared the predicted locations 
of secondary structural components and the critical regions 
determined by this study. Interestingly, critical regions III 
and IV are predicted to be 0-sheet segments, the borders of 
which coincide with the boundaries of these critical regions. 
These regions are each flanked on the N-terminal side by a 
cysteine residue involved in the formation of the critical 
disulfide bridge: Cys-51 for critical region III and Cys-93 for 
critical region IV (Fig. 3). Therefore, this disulfide bond 
might play a direct role in the arrangement of these critical 
regions with respect to one another. Since these regions are 
predicted to be /3-sheet segments, it suggests that critical 
regions III and IV may form a parallel 0-sheet s* ^ 
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Fig. 3. Salient features of mGM-CSF. The four critical regions 
are indicated by bars above the primary amino acid sequence. The 
essential disulfide bridge is shown by the heavy line between the two 
cysteines that border the N-terminal sides of critical regions III and 
IV. The other nonessential disulfide bridge is represented by a 
broken line. 



The analysis presented here begins to unravel the different 
structural and/or functional elements of GM-CSF. Identifi- 
cation of critical regions provides the opportunity to rapidly 
analyze the essential components of the protein. The knowl- 
edge gained from these studies would lead to a better func- 
tional understanding of this lymphokine. 
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